TR - 2011 - 004 m - privacy for collaborative data publishing

نویسندگان

  • Slawomir Goryczka
  • Li Xiong
  • Benjamin C. M. Fung
چکیده

In this paper, we consider the collaborative data publishingproblem for anonymizing horizontally partitioned data atmultiple data providers. We consider a new type of “in-sider attack” by colluding data providers who may use theirown data records (a subset of the overall data) in addition tothe external background knowledge to infer the data recordscontributed by other data providers. The paper addressesthis new threat and makes several contributions. First, weintroduce the notion ofm-privacy, which guarantees that theanonymized data satisfies a given privacy constraint againstany group of up to m colluding data providers. Second, wepresent heuristic algorithms exploiting the EG monotonicityof privacy constraints, and adaptive ordering techniques forefficiently checking m-privacy given a group of records. Forremaining privacy constraints we present a verification algo-rithm with the minimal number of privacy checks. Third,we present a data provider-aware anonymization algorithmwith adaptive m-privacy checking strategies to ensure highutility, and m-privacy of anonymized data with efficiency.Finally, we implement all algorithms (verification and ano-nymization) for settings with a trusted third party and in-troduce secure computation protocols for scenarios withoutsuch party. All protocols are extensively analyzed, their se-curity and efficiency is formally proved. Experiments onreal-life datasets suggest that our approach achieves betteror comparable utility and efficiency than existing and baseline algorithms while satisfying m-privacy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-Privacy Collaborative Data publishing with Efficient Anonymization Techniques

Privacy-preserving in collaborative data publishing provides methods and tools for publishing the data while protecting the sensitive information in the data set. The success of data mining in privacy relies on the information sharing and quality of data in a distributed environment. Several anonymization techniques have been proposed such as bucketization, generalization which does not prevent...

متن کامل

Secure Distributed Data Anonymization and Integration with m-Privacy

In this paper, we study the collaborative data publishing problem for anonymizing horizontally partitioned data at multiple data providers. We consider a new type of “insider attack” by colluding data providers who may use their own data records (a subset of the overall data) to infer the data records contributed by other data providers. The paper addresses this new threat, and makes several co...

متن کامل

Anonymization for Secure Data in Web

In this paper, we consider the collaborative data publishing problem for anonymizing horizontally partitioned data at multiple data providers. We consider a new type of " insider attack " by colluding data providers who may use their own data records (a subset of the overall data) to infer the data records contributed by other data providers. The paper addresses this new threat, and makes sever...

متن کامل

ارایه یک روش جدید انتشار داده‌ها با حفظ محرمانگی با هدف بهبود دقّت طبقه‌‌بندی روی داده‌های گمنام

Data collection and storage has been facilitated by the growth in electronic services, and has led to recording vast amounts of personal information in public and private organizations databases. These records often include sensitive personal information (such as income and diseases) and must be covered from others access. But in some cases, mining the data and extraction of knowledge from thes...

متن کامل

Secure Multi-party Computation Protocols For Collaborative Data Publishing With m-Privacy

In this paper collaborative data publishing setting with horizontally partitioned data across multiple data providers, in additional bag round knowledge of each contributing a subset of records . As a special case, a data provider could be the data owner itself who is contributing its own records. This is a very common scenario in social networking and recommendation systems. In this paper we i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012